Metric recovery from directed unweighted graphs

نویسندگان

  • Tatsunori B. Hashimoto
  • Yi Sun
  • Tommi S. Jaakkola
چکیده

We analyze directed, unweighted graphs obtained from xi ∈ R by connecting vertex i to j iff |xi − xj | < ε(xi). Examples of such graphs include k-nearest neighbor graphs, where ε(xi) varies from point to point, and, arguably, many real-world graphs such as copurchasing graphs. We ask whether we can recover the underlying Euclidean metric ε(xi) and the associated density p(xi) given only the directed graph and d. We show that consistent recovery is possible up to isometric scaling when the vertex degree is at least ω(n log(n)). Our estimator is based on a careful characterization of a random walk over the directed graph and the associated continuum limit. As an algorithm, it resembles the PageRank centrality metric. We demonstrate empirically that the estimator performs well on simulated examples as well as on real-world co-purchasing graphs even with a small number of points and degree scaling as low as log(n).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From random walks to distances on unweighted graphs

Large unweighted directed graphs are commonly used to capture relations between entities. A fundamental problem in the analysis of such networks is to properly define the similarity or dissimilarity between any two vertices. Despite the significance of this problem, statistical characterization of the proposed metrics has been limited. We introduce and develop a class of techniques for analyzin...

متن کامل

New Parameterized Algorithms for APSP in Directed Graphs

All Pairs Shortest Path (APSP) is a classic problem in graph theory. While for general weighted graphs there is no algorithm that computes APSP in O(n3−ε) time (ε > 0), by using fast matrix multiplication algorithms, we can compute APSP in O(n logn) time (ω < 2.373) for undirected unweighted graphs, and in O(n2.5302) time for directed unweighted graphs. In the current state of matters, there is...

متن کامل

A Constant-Factor Approximation Algorithm for Embedding Unweighted Graphs into Trees

We present a constant-factor approximation algorithm for computing an embedding of the shortest path metric of an unweighted graph into a tree, that minimizes the multiplicative distortion.

متن کامل

On the complexity of the multicut problem in bounded tree-width graphs and digraphs

Given an edgeor vertex-weighted graph or digraph and a list of source-sink pairs, the minimum multicut problem consists in selecting a minimum weight set of edges or vertices whose removal leaves no path from each source to the corresponding sink. This is a classical NPhard problem, and we show that the edge version becomes tractable in bounded tree-width graphs if the number of source-sink pai...

متن کامل

The Directed Minimum Latency Problem

We study the directed minimum latency problem: given an n-vertex asymmetric metric (V, d) with a root vertex r ∈ V , find a spanning path originating at r that minimizes the sum of latencies at all vertices (the latency of any vertex v ∈ V is the distance from r to v along the path). This problem has been well-studied on symmetric metrics, and the best known approximation guarantee is 3.59 [3]....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1411.5720  شماره 

صفحات  -

تاریخ انتشار 2015